Harmonic filtering for joint estimation of pitch and voiced source with single-microphone input
نویسندگان
چکیده
Standard correlation based methods are not effective in estimating pitch tracks of multiple speech sources from a single-microphone input In this paper, an adaptive harmonic filtering is proposed to jointly estimate the source signals and their corresponding fundamental frequencies. By exploiting the harmonic structure of voiced speech, pitch information of one source is extracted from the pitch prediction filter and the output residual becomes the estimate of the other source. The procedure is iterated successively with a summation constraint. From the evolution of pitch prediction filter, it is shown that the iterative harmonic filtering with the summation constraint is effective to separate multiple pitch tracks into individual ones.
منابع مشابه
Approximate Kalman Filtering for the Harmonic plus Noise Model
We present a probabilistic description of the Harmonic plus Noise Model (HNM) for speech signals. This probabilistic formulation permits Maximum Likelihood (ML) parameter estimation and speech synthesis becomes a straightforward sampling from a distribution. It also permits development of a Kalman filter that tracks model parameters such as pitch, harmonic amplitudes, and autoregressive coeffic...
متن کاملA Novel Voicing Cut - off Det for Low Bit - Rate Harmonic
Generally, phonetic classification for low rate speech coding is restricted to either a simple binary voiced/unvoiced classification of entire speech frames, or alternatively, a more complicated estimation of the voicing for a set of frequency bands. A good compromise between these two techniques is estimation of a single cut-off frequency that separates the spectrum into voiced (below) and unv...
متن کاملSingle Microphone Blind Audio Source Separation Using Short+Long Term AR Modeling
In this paper, we consider the case of single microphone Blind speech separation. We exploit the joint model of speech signal (the voiced part) that consists on modeling the correlation of speech with a short term autoregressive process and its quasi-periodicity with a long term one. A linear state space model with unknown parameters is derived. The separation is achieved by estimating the stat...
متن کاملVoiced Speech Synthesis Using Pitch Asynchronous Code Excited Linear Filters for the Glottal Source
This paper proposes a model for natural quality voiced speech synthesis using code excited linear all-pole filter for modeling the glottal source signal. Classical glottal signal models are explicit-time functions which inhibit joint sourcetract parameter estimation and require pitch synchronous estimation with precise segmentation of open and closed glottis phase. These problems are overcome i...
متن کاملSingle channel speech separation in modulation frequency domain based on a novel pitch range estimation method
Computational Auditory Scene Analysis (CASA) has been the focus in recent literature for speech separation from monaural mixtures. The performance of current CASA systems on voiced speech separation strictly depends on the robustness of the algorithm used for pitch frequency estimation. We propose a new system that estimates pitch (frequency) range of a target utterance and separates voiced por...
متن کامل